Synchronisation of secure audiovisual streams

ABSTRACT

This invention relates to a device capable of transmitting in a secured manner a set of films of high visual quality to a display screen of the TV screen type and/or for being recorded on the hard disk or on any other recording support of a box/connector connecting the teletransmission/telecommunication network to the display screen such as a TV screen or a personal computer monitor while preserving the audiovisual quality, but avoiding fraudulent use such as the possibility of making pirated copies of films or audiovisual programs recorded on the hard disk or on any other recording support of the set-tope boxy/decoder boxy. The invention also concerns a client-server system and the synchronization mechanism between the server supplying the stream permitting the viewing of the secured digital video film and the client who reads and displays the digital video stream.

The present invention relates to the area of the processing of digital video streams.

The present invention proposes supplying a system that permits the recomposing of a digital video content that was previously visually scrambled.

The present invention relates more particularly to a device capable of transmitting in a secured manner a set of films of high visual quality to a display screen of the TV screen type and/or for being recorded on the hard disk or on any other recording support of a box [connector] connecting the teletransmission [telecommunication] network to the display screen such as a TV screen or a personal computer monitor while preserving the audiovisual quality but avoiding any fraudulent use such as the possibility of making pirated copies of films or audiovisual programs recorded on the hard disk or on any other recording support of the set-top box [decoder box]. The invention concerns a client-server system and the synchronization mechanism between the server supplying the stream permitting the viewing of the secured digital video film and the client who reads and displays the digital video stream.

It is possible with the current solutions to transmit audiovisual films and programs in digital form via broadcasting networks of the radio [wireless], cable, satellite, etc. type or via telecommunication networks of the DSL (Digital Subscriber Line) type or BLR (local radio loop) or via DAB (Digital Audio Broadcasting) networks, etc. Moreover, in order to avoid the pirating of works broadcast in this manner, these works are frequently encrypted or scrambled by various means well-known to the expert in the art.

However, the main disadvantage of all the current solutions (TiVO Inc., WO00165762) is that it is necessary to transmit not only the encrypted data to the users but also the decryption keys. The transmission of the decryption keys can be made before, at the same time as or after the transmission of the audiovisual programs. In order to increase security and therefore the protection of audiovisual works against an ill-intended use, the decryption keys as well as the decryption functions of the audiovisual decoders can comprise improved security means such as smart cards or other physical keys that can optionally be remotely updated.

Thus, the current solutions applied to a set-top box with the possibility of the local recording of audiovisual programs in digital form on any support of the hard disk type or some other type of memory offers an ill-intentioned user the possibility of making non-authorized copies of the programs recorded in this manner, since at a given moment this user possesses with his digital set-top box, whether associated or not associated with smart card systems, all the information, software programs and data permitting a complete decryption of the audiovisual programs. Precisely by virtue of the fact that he possesses all the data, the ill-intentioned user will have the possibility of making illegal copies without anyone perceiving this fraudulent copy at the moment at which it is made.

One solution could therefore consist in transmitting all or part of a digital audiovisual program solely on demand (video services on demand) through a broadband telecommunication network of the ADSL, cable or satellite type without authorizing the local recording of the audiovisual programs. The disadvantage here is quite different and stems from the performances of these networks, that do not allow continuous streams of several megabits per second to be guaranteed to each user, as required by MPEG streams, that require bandwidths of several hundred kilobits to several megabits per second.

Under these conditions, one solution consists in separating the streams into two parts of which one can not be used by itself. Several patents have been filed to this end. Thus, document WO09908428 (Gilles Maton) teaches a process for the multi-application processing of an active, localizable terminal in which at least one connection is made to an identifiable program dedicated to carrying out an application, which program dictates its conditions of use to the terminal for making the functions available. The terminal communicates on an ad hoc basis by using a connection with the management center for the realization, if necessary, of the inputs and outputs of the capacities of the latter, which management center becomes a slave or does not become a slave of the terminal at the level of the application software relative to the entering program. This invention also concerns the process of identifying the program and the terminal in use. This process of the prior art divides the stream into one part serving to identify the user and into one part containing the program properly speaking. In particular, this program is not unusable but only locked by the first part. Finally, this patent does not present any solution for synchronizing these parts.

On the other hand, document EP 0778513 (Matsushita) describes a process permitting the prevention of the illegal use of information by adding control information in order to verify the rights of the user. The system permits the permanent knowledge of which part of the information is being used and by which user and consequently to know if this user is illegal or not. This process thus secures the data by adding additional information that denatures the initial information.

Document WO0049483 (Netquartz) also offers us processes and systems for creating a connection between the users and an editor of digital entities. The process comprises at least one of the following steps: The step of subdividing this digital entity into two parts; the step of memorizing one part in a memory zone of the server connected to a computer network; the step of transmitting the other part to at least one user with computer equipment; the step of connecting this computer equipment to this computer network; the step of establishing a functional connection between this first part and this second part. These processes and systems do not specify on the one hand if the memorized part on the server can be stored by the user, which would permit him to pirate this digital entity, and on the other hand do not specify the manner of synchronizing these two parts.

Finally, in this approach the state of the closest art is found in the patents of HyperLOCK Technologies, of which the most pertinent one is document U.S. Pat. No. 05,937,164. This invention uses a solution consisting in separating the stream into two parts of which the smallest one contains information necessary for using the largest one. However, this patent is not sufficient for responding to the problem identified. In fact, the suppression of one part of the stream denatures the format of the stream and it can therefore not be recognized as a standard stream that can be used with general software applications. This process of the prior art simultaneously necessitates a specific software on the server side for the separation of the two parts and another specific software that assures not only the reconstruction of the stream but also the acquisition of the main stream and its use according to a format proprietary for the solution. This proprietary format is not the initial format of the stream before the separation into two parts in this known solution.

This company also filed three other patents: Document U.S. Pat. No. 5,892,825 resumes the preceding patent but in a narrower scope because the streams in it are always encrypted; document U.S. Pat. No. 6,035,329 is based on the same principle and concerns a process permitting the reading of a disk of the CE-ROM or DVD-ROM type conditioned by the identification of the rights by inserting a smart card on which the information necessary for reading is stored. This process is still not sufficient for our problem because it does not guarantee that the modified stream has the same format as the original stream. Finally, document U.S. Pat. No. 6,185,306 concerns a process for transmitting encrypted data from a web site to a requesting computer. However, this process allows the user to have all the tools necessary for copying the data at a given moment.

Another reference of the prior art is document WO 00/44172, that presents a system of video distribution on demand consisting of transmitting encrypted video streams from a supplier to one or several receivers. The encrypted video is stored in advance in the receiver and displayed at a later date. The display request is addressed to a video sender that sends the decryption information for an immediate display of the video stream. The decryption information is sent to the receiver via a separate path or via the same path as the encrypted video. Before sending the keys (static or dynamic) for the decryption a step for the identification of the receiver is carried out. This prior art therefore describes a system of encryption with the aid of a key or keys that is well known to an expert in the art. However, the entire video stream protected by encryption is stored in the receiver and the data set of the video stream is located inside the protected video and is therefore vulnerable to being pirated.

The prior art also contains document US 2002/0164024 A1, that concerns a system for the processing of video and audio data based on a relation of prediction between frames, containing a module for the fragmentation of the data, a module for the encryption of part of the data, a sending module, a module for differentiating the processing by type of frames I, P, B and their classification in separate files, a receiving module, a decryption module and a module for the assembling of data. The encryption process is carried out solely on fragments comprising images I, that are decrypted and reassembled in the receiver, thus reconstituting the video, which reconstitution is carried out by aligning the packets in increasing order of the time reference indicated in the binary stream. This document describes a system of “classic” encryption with optimization of the encryption. The fragmentation is applied in order to separate the I images from the rest of the stream in order to encrypt them. After this selective encryption the fragments are sent using different queues. However, all the data of the audiovisual stream remain inside the protected stream with all or part of the stream not being encrypted.

Finally, patent WO 01/97520 also teaches methods, processes and devices for controlling the transmission and recording of digital content of the MPEG-2 type. However, this patent does not teach any specificity for the synchronization of the streams constituting the two parts of one and the same audiovisual program. Moreover, the method described in this patent is totally ineffective for low bandwidth telecommunication networks because it substitutes all or part of the I images, whose weight in bytes is very expensive during the transmission of the second stream.

In order to correct these various deficiencies, the invention relates in its most general meaning to a process for the distribution of video sequences in accordance with a nominal stream format constituted by a succession of images (“pictures”), which nominal stream, on which an analysis is made prior to the transmission to the client equipment for generating a modified main stream, has the format of the nominal stream and has images modified by the substitution of certain data by data of the same nature but random or calculated, and has complementary information of any format comprising substituted data and digital information suitable for permitting the reconstruction of this modified nominal stream, and then for separately transmitting the modified main stream in real time or deferred time and the complementary information in real time at the moment of display from the server to the recipient equipment, and for which a synthesis of an available stream in a nominal format is calculated on the recipient equipment, reconstituted as a function of said modified main stream and of said complementary information and a reading of this available stream on the recipient equipment, characterized in that this process comprises during this reading of this stream a stage consisting of generating a position identifier as a function of the characteristics of this read stream, which position identifier is transmitted to the server that activates in response the sending of the complementary information as a function of this position identifier.

According to a first variant each image of the nominal stream is associated with a position indicator.

According to a second variant the reading stage comprises an operation for calculating the position identifier of the image read.

According to a third variant a reading stage comprises an operation for calculating the position identifier of the stream read.

The complementary information is advantageously sent in portions.

According to a particular embodiment of the invention the available stream on the recipient equipment of which the reading conditions the position and the portion to be sent of the complementary information is a part of the modified main stream.

According to another embodiment the available stream on the recipient equipment of which the reading conditions the position and the portion to be sent of the complementary information is a part of the reconstituted main stream.

In a particular embodiment of the invention the nominal stream format is defined by the MPEG-2 standard. In this particular embodiment said position identifier for an image is constituted by the time code variable associated with the group of images in which the image under consideration is located and the temporal reference variable for the image, that are variables defined by MPEG-2.

Each portion of this complementary information sent by the server advantageously permits the reconstitution of at least one image of the original stream during said synthesis.

According to a particular embodiment the server adapts the size and the content of each portion of this complementary information to be sent as a function of said position identifier.

In a preferred embodiment each portion of this complementary information is sent in advance relative to the instant of display of this image of the stream reconstituted with this portion.

In one embodiment the sender adapts the sending of complementary information when the user of the recipient equipment makes a pause, stopping the transmission of complementary information. Likewise, the server adapts the sending of complementary information when the user of the recipient equipment makes a rapid advance or a rapid return by sending the portion corresponding to the proper position for the commands “rapid advance” and “rapid return”. In the same manner, the server adapts the transmission of complementary information when a network breakdown occurs that prevents the client-server communication by stopping the transmission of complementary information during the breakdown and restarting it when the breakdown stops and it again receives the messages coming from the client.

Prior to the transmitting of the complementary information the server advantageously creates a table associating the pointers to the portions of the complementary information with the temporal [time-division] positions relative to the images of the video stream, stores this table on a support connected to the server and consults this table in order to determine the portion of complementary information to be transmitted after having received said position identifier.

The invention also relates to equipment for the production of a video stream for implementing the process as described above, comprising at least one multimedia server containing the original video sequences, a device for analyzing the video stream coming from this server for generating said modified main stream and said complementary information and which comprises in particular a device for synchronizing the transmission of this complementary information as a function of said position identifier transmitted by the recipient equipment.

Finally, the invention relates to a system for the transmission of a video stream in accordance with the above-described process, comprising equipment for producing a video stream, at least one piece of equipment for making use of a video stream and at least one communication network between the production equipment and the piece or pieces of equipment for said making use [exploitation].

The present invention will be better understood from a reading of the following description of a non-limiting exemplary embodiment that makes reference to the attached drawings.

FIG. 1 describes the architecture of an entire system for implementing the process of the invention.

FIG. 2 shows a particular embodiment of the system for synchronizing audiovisual streams in conformity with the invention.

The general principle of a process for securing a video stream is disclosed below. The invention concerns a process for the distribution of digital video sequences according to a nominal stream format constituted by a succession of pictures (or the MPEG format), each of which comprises at least one digital block regrouping a certain number of coefficients corresponding to simple video elements coded digitally in accordance with a manner specified within the stream concerned, and used for all the video decoders capable of displaying it in order to be able to decode it correctly, and possibly organized in a hierarchical manner in a group of pictures and sequences.

The format advantageously contains means for localizing this digital picture in the stream with the aid of a temporal position identifier allowing the knowledge of at what moment the picture in question is to be displayed.

This process comprises:

-   -   A preparatory stage consisting of modifying at least one of said         elements of one of said pictures. The binary stream generated in         this manner is called the modified main stream;     -   A transmission stage:         -   of the modified main stream in conformity with the format of             the nominal stream, constituted by pictures containing the             elements modified in the course of the preparatory stage,             and         -   Complementary digital information via a path separate from             said modified main stream that permits the reconstitution of             the original stream from the calculation on the recipient             equipment as a function of this modified main stream and of             this complementary information. This complementary             information is defined as a set constituted by data (e.g.,             elements describing the original digital stream or extracts             of the original stream) and by functions (e.g., the             substitution or swapping [permutation] function). A function             is defined as containing at least one instruction relating             data and operators. This complementary information describes             the operations to be carried out in order to recover the             original stream from the modified main stream.

The reconstitution of the original stream is carried out on the recipient equipment from the modified main stream already present in the recipient equipment or transmitted in real time and from the complementary information transmitted in real time at the moment of display comprising data and functions carried out with the aid of digital routines (set of instructions). The complementary information is transmitted by the server as a function of the position of the reading head in the stream available on the client equipment. This position, defined by a position identifier or synchronization element is transmitted regularly by the client to the severs in accordance with the instant of display by the client of the stream available on the client equipment. This available stream on the client equipment is identical to the original stream if the client has the right to display it and if he has a connection to the server containing the information complementary to or identical to the modified main stream if this is not the case.

When the available stream on the recipient equipment is identical to the original stream the user can view it on his screen and if the user wishes to move in the video sequence the client informs the server of the new position of the reading head and the server then transmits the necessary complimentary information to the client for reconstituting the portion of the film that he is now to view.

In the present invention the term “scrambling” denotes the modification of a digital video stream with suitable methods in such a manner that this stream remains in conformity with the standard with which it was digitally encoded while rendering it playable by a visual display but altered from the viewpoint of human visual perception.

In the present invention the term “descrambling” denotes the restitution process with appropriate methods of the initial stream, which video stream restored after the scrambling is identical to the initial video stream.

The present invention proposes a protection by means of a scrambling of the video stream integrally based on its structure, which protection consists in modifying targeted parts of the bitstream (structured binary stream) essential for the comprehension of the film by the human eye. The true values of these targeted parts are extracted from the bitstream and stored as complementary information and random or calculated values or swapped values are placed in their place, which is done for the entire video stream. Thus, “decoys” are added for the set-top box, that is to say, values comprehensible to the set-top box but not identical to those presented in the nominal stream, which receives a video stream at the input completely conforming to the original video format but which is not acceptable from the viewpoint of visual perception by a human being.

Inversely to the majority of encryption systems already known by an expert in the art, the principle described below permits a high level of protection to be ensured without necessitating a high bandwidth client-server connection since said complementary information to be transmitted only represents a low percentage of the original video stream and the modified main stream is already present in the client equipment.

The protection, realized in a manner in conformity with the invention, is based on the principle of the suppression and/or replacement of information describing the video signal by any method, e.g., substitution, modification or shifting of the information. This protection is also based on a knowledge of the structure of the stream at the output of the video encoder: The scrambling is a function of the content of said digital video stream. The reconstitution of the original stream is performed on the recipient equipment from the modified main stream already present or received in real time in the recipient equipment and from the complementary information transmitted in real time at the moment of display comprising data and functions executed with the aid of digital routines (set of instructions).

The present invention concerns in particular the synchronization process between the supplier of the complementary information (the server) and the reader/display device installed at the client. To this end the invention uses synchronization elements (or position identifiers of the reading head in the stream available on the recipient equipment) that permit the connection to be made between a given portion of the complementary information and the part of the modified main stream that it allows to be modified in order to reconstitute the corresponding part of the nominal stream. For example, the invention makes use of temporal position data relative to the reading head of the client in the video stream available on the client equipment like that contained in a MPEG stream in order to determine the complementary information to be translated.

Another possibility, corresponding to another exemplary embodiment, is to number the pictures of the modified main stream and to indicate in the different portions of the complementary information the number of the picture or pictures of the modified main stream that said portion allows to be modified in order to reconstitute the corresponding part of the nominal stream.

Another exemplary embodiment consists in utilizing binary words calculated from the modified main stream in that a given binary word is specific to a given portion of the modified main stream, that is then specified in the corresponding portion of the complementary information.

Another exemplary embodiment consists in utilizing binary words calculated from the modified main stream in that a given binary word is specific to a given picture of the modified main stream, that is then specified in the corresponding portion of the complementary information.

Another exemplary embodiment consists in adding a single [unique] binary word to each user field of the modified main stream (field “user data” of MPEG-2, for example) and to the corresponding portion of the complementary information. A user field is characterized in that binary information can be added to it without affecting the displaying of the binary video stream containing it.

Whatever the solution selected, the client regularly transmits to the server the position identifier of the reading head of the client in the stream available on the recipient equipment (temporal position or binary word) permitting the served to determine the portion of the complementary information which the client equipment requires for transforming the modified main stream in order to reconstitute the corresponding part of the nominal stream.

The invention will be better understood from a reading of an exemplary embodiment of the invention that makes reference to FIGS. 1 and 2.

In this exemplary embodiment the invention relates to a video sequence encoded in the MPEG-2 format without this constituting a reduction of the scope of the present invention.

In FIG. 1 the arrangement of the video interfacing 8 is adapted in such a manner as to connect at least one display device, e.g., a monitor, a video projector or a device of the television screen type 6 to at least one interface of a broadband transmission and broadcasting network 4 and to at least one telecommunication network interface 10. According to the present invention this arrangement is composed of module 8 comprising primarily, on the one hand, a processing unit adapted to process and in particular to decode and descramble any video stream of the MPEG-2 type according to a pre-loaded software program of decoding and descrambling in such a manner as to display it in real or deferred time, store it, register it and/or transmit it via a telecommunication network, and comprising, on the other hand, at least one screen interface 7 and one interface for connection to a local or wide-area network 5 and/or 9. Broadband transmission and broadcasting network 4 and telecommunication network can be combined in a single network.

The hard disk or the recording device of module 8 can be used as buffer memory for temporarily storing at least a part of the program or of the video sequence to be displayed in the case of a deferred display or of a limitation in a broadband of the transmission network. The display can be delayed or deferred upon the request of the user or of portal 12.

As FIG. 1 shows, connection interface 5 is connected to broadband transmission and broadcasting network 4 such as a modem, satellite modem, cable modem, a fiber optic line interface or a radio or infrared interface for wireless communication.

The content of audiovisual programs such as films will be transmitted via this classic video broadcasting connection. However, in order to prevent pirated copies, before transmitting the audiovisual content from server 1 or portal 12, a small part of the audiovisual content is retained in portal 12.

In the case of a displaying of an audiovisual program in real time, this small part of the audiovisual content retained in portal 12 will also be transmitted to module 8 in real time via telecommunication network 10.

In the MPEG-2 format the audiovisual stream is divided into a hierarchy of structures nested into each other. Thus, a “stream” containing an indefinite number of groups of pictures connected to each other (GOP: Group of Pictures); a group of pictures containing a certain number of pictures (generally 12 or 15 for MPEG, but this is not obligatory); a picture is decomposed into slices; a slice contains a series of macroblocks; a macroblock regroups between 6 and 12 blocks; a block contains the information relative to a square of 8×8 pixels in the forms of frequency coefficients. The stream is compressed by a direct cosine transformation (DCT) that is applied to each block in such a manner as to concentrate the pertinent information in only certain coefficients in such a manner as to be able to suppress the others and thus reduce the quantity of information to be stored, and at a sampling of transformed coefficients and a coding intended to reduce the size of the stream (e.g., variable-length coding or a coding of the run level type.

In this embodiment described relative to FIGS. 1 and 2, nominal stream 101 comes from server 1 in order to be transmitted to portal 12. Analyzing device 121 of portal 12 proceeds to an analysis of nominal stream 101 in order to constitute on the one hand the modified main stream 122 and on the other hand the complementary information 123. Modified main stream 122 is transmitted to the client in any manner. This manner can be: Via broadband network 4 of the BLR or DSL type, via a mobile network of the GSM type, or even by means of a CD-ROM or another physical support. The client stores modified main stream 122 on a physical support 85 situated at his location, which physical support can be a hard disk or a CD-ROM.

When client 8 wishes to view the video sequence corresponding to this modified main stream he sends the request to server 12, specifying an identifier of the video sequence requested and supplying at least one identifier of the client 8. Server 12 receives the request from the client and verifies whether he has the right to view the requested sequence using the identifier of the sequence and that of the client. This verification can be made, e.g., with the aid of a database cataloging the list of authorized video sequences for each client.

If client 8 is authorized to view the requested video sequence, server 12 establishes a connection with the client in order to transmit complementary information 123 via network 10. When the connection has been established, client 8 transmits to server 12 the identifiers of the position of the reading head in the requested video stream. Server 12 receives the information about the position and adapts the content of complementary information 123 transmitted to the client via connection 10 as a function of this position.

In another embodiment of this invention the positional information is transmitted at the same time as the request for the video sequence and the server begins the broadcasting of the complementary information corresponding to this position after having verified the authorization.

Client 8 receives the part of complementary information 123 corresponding to the position of the reading head and stores it in input buffer 86. This buffer is preferably a volatile memory. At the same time the client reads the modified main stream corresponding to this position from storage support 85 via reading buffer 83. Synthesis device 87 uses the complementary information stored in input buffer 86 and the main stream stored in reading buffer 83 for reconstituting the original stream without error and transmitting it to reader 81. The original stream read by reader 81 is then displayed on display device 6.

In this exemplary embodiment two variables for each picture of the stream are used as position identifiers: The temporal reference variable present in the picture header field and the time code variable present in the Group of Pictures Header field for the group of pictures in which group the picture under consideration is located. These variables allow a picture in an MPEG-2 video stream with a total duration of less than 24 h to be identified in a unique manner. The client regularly transmits his position in the video stream to the server by communicating these two pieces of data to it. The server adapts the portion of the complementary information to be transmitted as a function of this position. In fact, each portion of the complementary information contains a copy of these position identifiers or synchronization elements that permit a unique connection to be made between the picture of the modified main stream and the corresponding portion of the complementary information that permits the modified main stream to be modified in order to reconstitute the corresponding part of the nominal stream. In this exemplary embodiment the complementary information is contained in a single file. When server 12 receives the positional variables coming from the client it determines the portion of complementary information to be transmitted by browsing [scanning] this file. In order to make the searching for the desired portion more rapid, the invention can advantageously make use of a table that makes a position in this file correspond to a picture of the scrambled main stream, which table is realized during a previous phase associated with the analysis of the main stream.

In an alternative embodiment the synchronization element or position identifier of the reading head in the stream available on the recipient equipment is the number of the current picture, that is to say, the order of appearance of this picture within the corresponding modified main stream. For example, the first picture of the modified main stream has the number 1, the second picture the number 2 and the 22nd picture the number 22. This number is also indicated in the complementary information in such a manner as to be capable of establishing the connection between the picture of a modified main stream and the portion of the corresponding complementary information that permits the modified main stream to be transformed in order to reconstitute the corresponding part of the nominal stream.

In a third exemplary embodiment the synchronization elements or position identifiers of the reading head in the stream available on the recipient equipment are binary words calculated from the modified main stream and one given binary word is specific to the content of a given portion of the modified main stream, which portion is characterized by its binary position defined by the number of bits separating it from the beginning of the stream, and its size, which binary word is specified in the portion of the corresponding complementary information. This binary word is calculated according to the binary content of this portion in such a manner that two different referenced portions produce different binary words. In order to obtain the binary word a table of hash code can be used. A hash code table is a set of inputs, each of which is constituted by one key and one value. It is not possible to have two inputs with the same key. From a key, a hash code table can very rapidly find the corresponding input. There are numerous algorithms of this type that are especially used in telecommunications for detecting transmission errors.

In a variant of the previous example this binary word is not calculated according to the binary content of a portion of the modified main stream characterized by its binary position and its size but rather according to the binary content of a given picture of the modified main stream.

In another exemplary embodiment the synchronization elements or position identifiers of the reading head in the stream available on the recipient equipment are binary words inserted into each user fields of the modified main stream (user data field of MPEG/2 that can be inserted before each picture of an MPEG-2 video stream) and into the portion of corresponding complementary information. Each binary word is different in such a manner that the connection between a picture of the modified main stream and a portion of the complementary information is unique.

Whatever the manner of constitution of these position identifiers of the reading head, in a normal and continuous operation of displaying the video stream (simple reading) client 8 transmits information to the server corresponding to the last picture or to the last displayed portion of the stream available on the recipient equipment that can be issued from the modified main stream if the client did not receive the corresponding complementary information for modifying the main stream in order to reconstitute the corresponding part of the nominal stream, or issued from the reconstituted stream identical to the nominal stream in the contrary case. The server receives this positional information and calculates the portion of complementary information to be transmitted in order to permit the viewing of the content in accordance with this last displayed position. In the embodiment of interest for us this calculation is possible due to the correspondence between the temporal reference couples (temporal code present in the messages transmitted by the client to the server) and the couples of the same nature described in the complementary information. In normal operation the position transmitted by client 8 corresponds to a portion of complementary information recently transmitted by server 12. The latter thus transmits the following portion of the complementary information to client 8 by network 10. This portion corresponds in fact to pictures that the client will soon display on display device 6 in order to allow synthesis device 87 time to descramble the stream. If the server transmits the complementary information corresponding to what the client is currently displaying, the latter would arrive too late at the client's to be able to be used and it is therefore necessary to transmit it with a slight anticipation. Server 12 transmits the complementary information by packets, each comprising the information necessary to reconstitute several pictures. If each packet corresponds to a duration t of the video sequence and the transmission of the packet occupies a duration t′, server 12 waits a duration t-t′ between the end of the transmission of a packet and the start of the transmission of the following packet. At the end of this wait the server wakes up and analyses the messages coming from the client. These messages then condition the behavior of the server in the manner described above.

Server 12 transmits portions of complementary information as long as it receives messages from client 8 giving it the position of the reading head. When the client stops the reading (a pause or stop or network disconnection) it stops transmitting positional information to the server. In this realized example if the server does not receive messages from the client equipment during the time necessary for the latter to view the video stream corresponding to the last packet of complementary information transmitted (duration t), that is to say, if it is not receive new messages when it has been waked up, it stops transmitting portions of complementary information.

The transmission of portions of complementary information reassumes when the client recommences reading the video sequence and thus recommences transmitting positional information. Thus, in the case of a network failure preventing communication between the client equipment and the server, when the failure occurs the server ceases to receive position identifiers of the reading head and therefore stops transmitting the complementary information; and when the failure ceases the client equipment transmits its current position in the stream available on the client equipment, the server receives it, adapts to it and transmits the corresponding complementary information. Likewise, in the case of a pause or a halting of the viewing of the video stream by the client, the server no longer receives messages from the client and therefore stops transmitting the complementary information. The latter reassumes when the client starts reading the video stream again that is available on the client equipment.

In the case in which the client desires to stop viewing the video stream for a prolonged duration, the client-server session is advantageously closed. When the session recommences following the desire of the client to start viewing the video stream again and following his reconnection to the server, the server transmits the complementary information from the portion corresponding to that which it received before the end of the session. The displaying of the video stream of the client equipment does not recommence until at the moment at which it begins to receive the complementary information from the server.

If the client goes back in the video sequence, the new position transmitted to the server is a position prior to the last position transmitted. The following transmission of the server is therefore a part of the complementary information situated before the last part transmitted. The quantity and therefore the duration of the complementary information transmitted by the served is a function of the reverse [back] speed that was selected by the client. This functionality permits several reverse return speeds to be offered on the client equipment.

Likewise, if the client transmits a command “rapid advance” in the video sequence, the new position transmitted to the server is subsequent to the position “expected” by the server, that is, the position corresponding to the last part of the complementary information transmitted by the server to the client. The following transmission of the server is therefore a part of the complementary information situated after the last part transmitted. The quantity of the complementary information transmitted by the server is a function of the “rapid advance” speed that was selected by the client. This functionality permits several “rapid advance” speeds to be offered on the client equipment.

In order to improve the synchronization between the server and the client the invention also comprises a mechanism for acknowledging the reception. The portion of complementary information transmitted by server 12 is stored in entrance buffer 86. If synthesis device 87 requires this complementary information for reconstituting the original stream and if this occurs, client 8 transmits a confirmation message for specifying to server 12 whether it received the complementary information and rather it was able to use it for the display. If the client was not able to use the complementary information this signifies to the server that this information arrived too late (after the moment at which it should have been used) and therefore that the client and the server are out of sync. In this instance input buffer 86 is empty and server 12 adapts the information stream that it is transmitting in such a manner as to refill this buffer. To this end server 12 must anticipate the reading of client 8 longer. It then has two solutions:

-   -   Either it increases the number of pictures in the next packet.         This solution allows the continuity of the displayed stream to         be retained but necessitates a network 10 sufficiently large to         support a momentary increase of the traffic;     -   Or, it selects a portion of complementary information subsequent         to that which it should have transmitted to assure the         continuity of the video stream. Then, the pictures for which no         complementary information is transmitted remain scrambled.

In another exemplary embodiment the network protocol used for the communications between the client and the server is UDP (User Datagram Protocol).

Finally, another embodiment is described in the following that concerns the synchronization in a protection system applied to audiovisual streams in the MPEG-2 TS (Transport Stream) format, defined by the MPEG-2 standard for a broadcasting of data that is robust re transmission errors on the networks. Each of the audio or video tracks contained in the stream is decomposed into a series of packets of 188 bytes called TS packets. Each TS packet contains a header indicating to which audio or video track the packet is attached, the order of processing the packets and the synchronization information for the associated audio and video tracks.

During the scrambling stage some of the TS packets relative to the video streams are substituted by “decoy” packets in conformity with the standard in order to visually degrade the video stream. Each MPEG-2 packet in the stream is identified in a unique manner in order to correctly synchronize the modified main stream and the complementary information and to reinsert the original packets into the stream during the descrambling phase.

The operation of descrambling the stream protected by the substitution of TS packets is simple and efficacious. The descrambling module utilizes information relative to the MPEG-2 encapsulation of data, called “identifiers” without utilizing the data relative to the video content for the synchronization.

These identifiers utilized for the synchronization are:

-   -   PID (Program Identity) of the substituted packet;     -   The continuity counter of the substituted packet;         -   The last PCR (Program Clock Reference) encountered relative             to the video stream concerned;     -   The occurrence index of this continuity counter from the last         MPEG-2 packet including a PCR. As this index is not present in         the header of the TS packets, it is calculated during the         scrambling and descrambling phases.

The “program identifier” of a TS packets is located in the header of each TS packet, allowing an MPEG-2 decoder to associate all the TS packets relative to one in the same stream during the demultiplexing.

The continuity counter of a TS packet is a cyclical counter varying between 0 and N−1 and permitting the packets to be put back in order in the case of a permutation [shifting] or loss of packets due to a network transmission error in a group of N consecutive packets.

The program clock reference is an optional binary field and allows the decoder to calculate a time base.

The occurrence index of the continuity counter not present in the TS packets is calculated by analysis and scrambling module 12. This occurrence index corresponds for a given packet to the number of TS packets having the same program clock reference and the same continuity counter that followed one another since the last program clock reference field. The program clock reference is relative to the program identifier of the packet. Consequently, a TS packet containing a program clock reference will necessarily have an occurrence index of 1. The TS packet of the same program identifier and of the same continuity counter that follows will therefore have an occurrence index of 2 (if no program clock reference slipped in for this stream in the meantime). The occurrence indices of the following packets are incremented by 1 if their continuity counter has turned through one cycle (N) and their program identifier is identical until a new program clock reference relative to the same stream is encountered.

When the scrambling module backs up in the complementary information the original TS packets that were substituted, it systematically associates the four identifiers previously cited with them:

-   -   The program identifier of the packet allows it to be known, on         the client side, to which elementary stream the packet belongs.     -   The last program clock reference encountered permits the         situating of the temporal slice (with a granularity of 100 ms)         to which the packet belongs.     -   The occurrence index of the continuity counter permits the         identification of a group of N TS packets to which the packet         belongs.     -   The continuity counter permits the identification of exactly         which packet is being referred to within this group of N         packets.

These four identifiers are used for the synchronization during the descrambling phase and the occurrence index of the continuity counter is also recalculated during the descrambling phase.

For example, in a solution of broadcasting MPEG-2 TS streams protected in real time, when client 8 wishes to display the stream the server transmits part of the complementary information containing the original TS packets in advance, including associated TS synchronization information. When descrambling module 87 receives a packet of complementary information it realizes with the aid of synchronization identifiers the correspondence with the packets of the modified main stream and substitutes them with the original packets present in the complementary information.

Within the framework of an application of video on demand the descrambling module regularly transmits to server 12 the synchronization identifiers from the packets of the modified main stream that is in the course of being descrambled and displayed on viewing screen 6. In this manner server 12 deduces from them the portion of complementary information which the client will require in the coming instants and sends it the necessary packets of complementary information. 

1-19. (canceled)
 20. A process for distributing video sequences in accordance with a nominal stream format including a succession of images/pictures, which nominal stream, on which an analysis is made prior to transmission to client equipment for generating a modified main stream, has a format of the nominal stream and has images/pictures modified by substitution of selected data by similar data, but random or calculated, and has complementary information of any format including substituted data and digital information suitable for permitting reconstruction of a modified nominal stream, and then for separately transmitting the modified main stream in real time or deferred time and the complementary information in real time at substantially the moment of display from a server to the client equipment, and for which a synthesis of an available stream in a nominal format is calculated on the client equipment, reconstituted as a function of the modified main stream and of the complementary information and a reading of the stream on the client equipment, wherein during reading of the stream a stage includes generating a position identifier as a function of characteristics of the stream, which position identifier is transmitted to the server that activates in response sending the complementary information as a function of the position identifier.
 21. The process according to claim 20, wherein each picture of the nominal stream is associated with a position identifier.
 22. The process according to claim 20, wherein the reading stage comprises calculating the position identifier of the picture read.
 23. The process according to claim 20, wherein the reading stage comprises calculating the position identifier of the stream read.
 24. The process according to claim 20, wherein the complementary information is transmitted in portions.
 25. The process according to claim 20, wherein the available stream on the recipient equipment of which the reading conditions the position and the portion to be sent of the complementary information is a part of the modified main stream.
 26. The process according to claim 20, wherein the available stream on the recipient equipment of which the reading conditions the position and the portion to be sent of the complementary information is a part of the reconstituted nominal stream.
 27. The process according to claim 20, wherein the format of the nominal stream is defined by the MPEG-2 standard.
 28. The process according to claim 27, wherein the position identifier for an image/picture includes a time code variable associated with a group of images/pictures in which the image/picture under consideration is located and the temporal reference variable of the image/picture.
 29. The process according to claim 20, wherein the format of the nominal stream is in the MPEG-2 TS format and the position identifier includes four variables identifying the program, program clock reference, continuity counter and occurrence index of the continuity counter, which occurrence index of the continuity counter results from a calculation applied to the TS packets.
 30. The process according to claim 20, wherein each portion of the complementary information sent by the server permits reconstitution of at least one image/picture of the original stream during the synthesis.
 31. The process according to claim 20, wherein the server adapts the size and content of each portion of the complementary information to be sent as a function of the position identifier.
 32. The process according to claim 20, wherein each portion of the complementary information is sent in advance relative to an instant of display of this picture of the steam reconstituted with the portion.
 33. The process according to claim 20, wherein the sender adapts sending complementary information when the user of the recipient equipment pauses, thereby stopping the transmission of complementary information.
 34. The process according to claim 20, wherein the server adapts sending complementary information when the user of the recipient equipment makes a rapid advance or a rapid return by sending a portion corresponding to a proper position for the commands “rapid advance” and “rapid return.”
 35. The process according to claim 20, wherein the server adapts transmission of complementary information when a network breakdown occurs that prevents client-server communication by stopping transmission of complementary information during the breakdown and restarting it when the breakdown stops and it again receives messages coming from the client.
 36. The process according to claim 20, wherein prior to transmitting the complementary information, the server creates a table associating the pointers to the portion of the complementary information with temporal positions relative to images/pictures of the video stream, stores the table on a support connected to the server and consults the table to determine a portion of complementary information to be transmitted after having received the position identifier.
 37. Equipment for producing a video stream comprising at least one multimedia server containing original video sequences, a device for analyzing the video stream coming from the server for generating a modified main stream and complementary information, a device for synchronizing transmission of the complementary information as a function of a position identifier transmitted by recipient equipment.
 38. A system for transmitting a video stream according to the process of claim 20, comprising equipment for producing a video stream, at least one piece of equipment for using a video stream and at least one communication network between the production equipment and the piece or pieces of equipment. 